Which Is a More Accurate Predictor in Colorectal Survival Analysis? Nine Data Mining Algorithms vs. the TNM Staging System

نویسندگان

  • Peng Gao
  • Xin Zhou
  • Zhen-ning Wang
  • Yong-xi Song
  • Lin-lin Tong
  • Ying-ying Xu
  • Zhen-yu Yue
  • Hui-mian Xu
چکیده

OBJECTIVE Over the past decades, many studies have used data mining technology to predict the 5-year survival rate of colorectal cancer, but there have been few reports that compared multiple data mining algorithms to the TNM classification of malignant tumors (TNM) staging system using a dataset in which the training and testing data were from different sources. Here we compared nine data mining algorithms to the TNM staging system for colorectal survival analysis. METHODS Two different datasets were used: 1) the National Cancer Institute's Surveillance, Epidemiology, and End Results dataset; and 2) the dataset from a single Chinese institution. An optimization and prediction system based on nine data mining algorithms as well as two variable selection methods was implemented. The TNM staging system was based on the 7(th) edition of the American Joint Committee on Cancer TNM staging system. RESULTS When the training and testing data were from the same sources, all algorithms had slight advantages over the TNM staging system in predictive accuracy. When the data were from different sources, only four algorithms (logistic regression, general regression neural network, bayesian networks, and Naïve Bayes) had slight advantages over the TNM staging system. Also, there was no significant differences among all the algorithms (p>0.05). CONCLUSIONS The TNM staging system is simple and practical at present, and data mining methods are not accurate enough to replace the TNM staging system for colorectal cancer survival prediction. Furthermore, there were no significant differences in the predictive accuracy of all the algorithms when the data were from different sources. Building a larger dataset that includes more variables may be important for furthering predictive accuracy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Artificial neural networks improve the accuracy of cancer survival prediction.

BACKGROUND The TNM staging system originated as a response to the need for an accurate, consistent, universal cancer outcome prediction system. Since the TNM staging system was introduced in the 1950s, new prognostic factors have been identified and new methods for integrating prognostic factors have been developed. This study compares the prediction accuracy of the TNM staging system with that...

متن کامل

Designing a System for Trend Analysis of Users in Website Surfing in Iran Using Data Mining and Text Mining Algorithms

Background and Aim: As of the entrance of web surfing to the lifestyle of a vast majority of people in the society and the need for a more accurate social and cultural policy making in the field, authors intended to analyze the behavior of the society users in viewing different websites so as to help politicians and practitioners. Methods: Design science research method is used in this research...

متن کامل

QUALITY IMPR TNM Staging of Colorectal Cancer Should be Reconsidered According to Weighting of the T Stage Verification Based on a 25-Year Follow-Up

The gradient monotonicity of existing tumor, node, metastases staging systems for colorectal cancer is unsatisfactory. Our proposed T-plus staging system strengthens weighting of the T stage. In this study, applicability of the T-plus staging system was verified with data of a Chinese colorectal cancer center. Records of 2080 nonmetastatic, advanced cancer patients undergoing colorectal cancer ...

متن کامل

Prognostic significance of preoperative prognostic nutritional index in colorectal cancer: results from a retrospective cohort study and a meta-analysis

The preoperative prognostic nutritional index (PNI) may forecast colorectal cancer (CRC) outcomes, but the evidence is not conclusive. Here, we retrospectively analyzed a cohort of patients from the Department of Surgical Oncology at the First Hospital of China Medical University (CMU-SO). We also conducted a meta-analysis of eleven cohort studies. Bayesian Information Criterion (BIC) was used ...

متن کامل

A New Knowledge-Based System for Diagnosis of Breast Cancer by a combination of the Affinity Propagation and Firefly Algorithms

Breast cancer has become a widespread disease around the world in young women. Expert systems, developed by data mining techniques, are valuable tools in diagnosis of breast cancer and can help physicians for decision making process. This paper presents a new hybrid data mining approach to classify two groups of breast cancer patients (malignant and benign). The proposed approach, AP-AMBFA, con...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2012